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Abstract 

The objective of the consistent-amplitude approach to quantum theory 
has been to justify the mathematical formalism on the basis of three main 
assumptions: the first defines the subject matter, the second introduces 
amplitudes as the tools for quantitative reasoning, and the third is an 
interpretative rule that provides the link to the prediction of experimental 
outcomes. In this work we introduce a natural and compelling fourth 
assumption: if there is no reason to prefer one region of the configuration 
space over another then they should be 'weighted' equally. This is the 
last ingredient necessary to introduce a unique inner product in the linear 
space of wave functions. Thus, a form of the principle of insufficient 
reason is implicit in the Hilbert inner product. Armed with the inner 
product we obtain two results. First, we elaborate on an earlier proof 
of the Born probability rule. The implicit appeal to insufficient reason 
shows that quantum probabilities are not more objective than classical 
probabilities. Previously we had argued that the consistent manipulation 
of amplitudes leads to a linear time evolution; our second result is that 
time evolution must also be unitary. The argument is straightforward 
and hinges on the conservation of entropy. The only subtlety consists of 
defining the correct entropy; it is the array entropy, not von Neumann's. 
After unitary evolution has been established we proceed to introduce the 
useful notion of observables and we explore how von Neumann's entropy 
can be linked to Shannon's information theory. Finally, we discuss how 
various connections among the postulates of quantum theory are made 
explicit within this approach. 

1 Introduction 

Quantum theory is a set of rules for reasoning in situations where even under 
optimal conditions the information available to predict the outcome of an ex- 
periment may still turn out to be insufficient. This explains why the notion 
of probability plays such a central role and immediately raises a number of 
interesting questions. 

One such question is whether these quantum probabilities differ in any es- 
sential way from ordinary classical probabilities. It is sometimes argued that 
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there is an element of subjectivity in the nature of classical probabilities that 
is not shared by quantum probabilities, that the latter are totally objective be- 
cause they are given by the Born rule, that is, by the modulus squared of the 
wave function. One of the purposes of this paper is to support the opposite 
point of view: we will argue that the probabilities assigned using the Born rule 
are neither more nor less subjective than say the probability 1/6 assigned to 
each face of a die when there is no reason to favor one face over another. We 
will show that there is a form of the principle of insufficient reason implicitly 
encoded into the usual postulates of quantum theory. 

A second question is the following. One would expect that if predictions are 
to be made on the basis of insufficient information then quantities that measure 
the amount of information, entropies, should play a central role P]]|||}]. Re- 
markably one finds that while the notion of entropy is indeed extremely useful, 
its use in foundational issues has been very limited Entropy is not 

mentioned in the postulates; it is introduced later either to analyze quantum 
measurements or in statistical mechanics where problems are sufficiently compli- 
cated that clean deductive methods fail and one is forced to use dirtier inference 
methods. A second purpose of this paper is to show that entropic arguments 
are, in fact, implicit in the usual quantum postulates. 

This paper is a continuation of previous work 0[|8| in which quantum the- 
ory is formulated as the only consistent way to manipulate amplitudes. In 
this consistent-amplitude quantum theory (CAQT) amplitudes have a clear in- 
terpretation: they are tools for reasoning that encode information about how 
complicated experimental setups are related to those more elementary setups 
from which they were built. The result of this approach is the standard quantum 
theory || Q| , in a form that is very close to Feynman's jlO| . 

The objective of CAQT has been to justify the mathematical formalism on 
the basis of rather general assumptions in the hope that this would not only 
clarify the formal connections among the various postulates of quantum theory 
but also illuminate the issue of how the formalism should be interpreted. In 
this respect the traditional approach has been to set up the formalism first and 
then try to find out what it all means. This problem of attributing physical 
meaning to mathematical constructs is a notoriously difficult one. So, instead 
of taking the standard quantum theory as axiomatized by, say, von Neumann, 
and then, appending an interpretation to it, the approach we take is to build 
the formalism and its interpretation simultaneously. 

In the brief summary of the CAQT given in section 2 three of the main 
assumptions are explicitly stated. The first concerns the subject matter: quan- 
tum theory is concerned with predicting the outcomes of experiments performed 
with certain setups. The second introduces amplitudes as the tools for quanti- 
tative reasoning, and the third is an interpretative rule that provides the link 
between the mathematical formalism and the actual prediction of experimental 
outcomes. 

It is quite remarkable that although the interpretative rule does not in itself 
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involve probabilities it can be used to prove the Born statistical 'postulate' (or, 
better, the Born 'rule') p| provided one extra ingredient is added. The need 
for this fourth assumption arises because the application of the interpretative 
rule requires a criterion to quantify the change in amplitudes when setups are 
modified. In || the criterion adopted was to use the Hilbert norm as the means 
to measure the distance between wave functions. Such a technical assumption 
without any obvious physical basis clearly detracts from the beauty and cogency 
of the argument. In section 3 this blemish is corrected; we do not remove the 
assumption, we just rewrite it in a form that is physically more appealing and 



suggestive [13 . The main point is that the components out of which setups are 
built, the filters, already supply us with a notion of orthogonality and this takes 
us a long way towards defining an inner product. Thus, instead of a strong 
and unnatural assumption about the Hilbert norm we make a much weaker and 
more natural assumption about the inner product. 

The fourth assumption takes the form of a symmetry argument: if there is 
no reason to prefer one region of the configuration space over another then they 
should be weighted equally. The mere fact that some such assumption is neces- 
sary already has interesting implications. The fact that a form of the principle 
of insufficient reason is implicit in the Hilbert inner product brings quantum 
probabilities closer to their classical counterparts. Quantum probabilities are 
not more objective than classical probabilities. The interpretation of quantum 
mechanics, just like that of other theories of inference, is affected by the issue 
of what probabilities mean. 

Once one finds that time evolution must be linear the obvious next ques- 
tion is whether it must also be unitary. These two issues of linearity and unitar- 
ity are usually considered together (for a short review see e.g. Jl4|). A common 
explanation for the unitarity of time evolution is that it guarantees the conser- 
vation of probabilities. This is true but it is also irrelevant; that probabilities 
should add up to one is true by definition flEI , and any non-conservation of 
probabilities can always be trivially fixed by reinterpreting as a relative 
probability rather than the probability itself. 

Another common explanation based on Wigner's theorem jl6) is also found 
to be inadequate. The idea is to start with a quantum kinematics given by 
a Hilbert space and deduce linear and unitary evolution from the assumption 
that time evolution is a 'symmetry' by which it is meant a transformation that 
preserves orthogonality among states. The question, of course, is why should 
time evolution be a 'symmetry' in this technical sense. In fact, when the as- 
sumption is relaxed one finds, as expected, that the corresponding dynamics is 
non- unitary and irreversible [ p"7[ . 

This suggests yet another approach. It is a matter of definition that entropy, 
as a measure of amount of information, is conserved whenever the information 
available for the prediction of experimental outcomes is not spoiled by the mere 
passage of time. The plan, then, is simple: impose entropy conservation and 



3 



from this deduce unitary time evolution. There is, however, one remaining 
obstacle: one must identify the correct entropy. The obvious candidate, von 
Neumann's entropy, fails. The problem is that the interpretation, the very 
meaning of von Neumann's entropy, is derived in the context of a linear quantum 
theory that is already assumed to be unitary B. Therefore, arguments based 
on von Neumann's entropy are circular. 

The argument we offer in section 4 is based on the idea of array entropy, a 
concept that was briefly introduced by Jaynes jt8| only to be dismissed as an 
inadequate candidate for the entropy of a quantum system, a quantity which he 
identified with von Neumann's entropy From the point of view of the CAQT, 
however, amplitudes and wave functions are assigned not just to the system 
but to the whole experimental setup, and this turns the array entropy into a 
legitimate entropy for our purpose. Its conservation implies the conservation 
of the Hilbert norm and unitary evolution. As claimed above, the notion of 
entropy plays an important role at the foundations of quantum theory; it is 
implicit in the postulate that time evolution is unitary. 

Up to this point the discussion has been about experiments involving ide- 
alized detectors localized at a given point in the configuration space. In the 
traditional language the only observable measured is position. In section 5 we 
address the issue of how observables other than position make their appearance 
within the CAQT approach. We find that these observables are useful concepts 
in that they facilitate the description of complex experiments but, from our 
point of view, they are of only secondary importance and play no role at the 
foundational level. 

The prominence awarded within the CAQT to the concept of array entropy 
stems partly from our choice of subject matter - experimental setups rather 
than quantum systems - and partly from the fact that it is the array entropy 
that provides the link between the Shannon information theory entropy and 
von Neumann's entropy. The short discussion in section 6 shows two ways to 
introduce von Neumann's entropy. This is an adaptation of the arguments of 
Jaynes jl8| and of Blankenbecler and Partovi Q . 

We conclude in section 7 with a summary of our results and a discussion of 
how various relations and connections among the postulates of quantum theory 
are made explicit and clarified within the CAQT approach. 

2 The consistent-amplitude approach to quan- 
tum theory 

We proceed in several steps; effectively, each step consists of making an assump- 
tion and then exploring its consequences. The first and most crucial assumption 
is a decision about the subject matter. What problem is quantum mechanics 
trying to solve? We choose a pragmatic, operational approach: statements 
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about a system are identified with those experimental setups designed to test 
them j?J H . Our strategy is to establish a network of relations among setups in 
the hope that information about some setups might be helpful in making pre- 
dictions about others. We find that there are two basic kinds of relations among 
setups, which we call and and or. These relations or operations represent our 
idealized ability to build more complex setups out of simpler ones, either by 
placing them in "series" or in "parallel" . 

Let us be more specific. To avoid irrelevant technical distractions we consider 
a very simple system, a particle that lives on a discrete lattice and has no spin 
or other internal structure. The generalization to more complex configuration 
spaces should be straightforward. The simplest experimental setup, denoted 
by [xf,xi\, consists of placing a source that prepares the particle at a space- 
time point Xi — (xi,ti) and placing a detector at Xf = (xf,tf). To test a 
more complex statement such as "the particle goes from Xi to x\ and from 
there to Xf, n denoted by [xf,xi,Xi], requires a more complex setup involving 
an idealized device, a "filter" which prevents any motion from xi to Xf except 
via the intermediate point x\. This filter is some sort of obstacle or screen that 
exists only at time t\, blocking the particle everywhere in space except for a 
small "hole" around x\. The possibility of introducing many filters each with 
many holes leads to allowed setups of the general form 

a= [xf,SN,SN-l,---,S2,Sl,Xi] , (1) 

where s n = (x n ,x' n ,x'^, . . .) is a filter at time t n , intermediate between U and 
tf, with holes at x n , x' n , x", ■ ■ ■ 

The first basic relation among setups, which we call and, arises when two 
setups a and b are placed in immediate succession resulting in a third setup 
denoted by ab. It is necessary that the destination point of the earlier setup 
coincide with the source point of the later one, otherwise the combined ab is not 
allowed. The second relation, called or, arises from the possibility of opening 
additional holes in any given filter. Specifically, when (and only when) two 
setups a' and a" are identical except on one single filter where none of the holes 
of a' overlap any of the holes of a" , then we may form a third setup a, denoted 
by a' V a" , which includes the holes of both a' and a". Provided the relevant 
setups are allowed the basic properties of and and or are quite obvious: or is 
commutative, but and is not; both and and or are associative, and finally, and 
distributes over or. We emphasize that these are physical rather than logical 
connectives. They represent our idealized ability to construct more complex 
setups out of simpler ones and they differ substantially from their Boolean and 
quantum logic counterparts. In Boolean logic not only and distributes over or 
but or also distributes over and while in quantum logic propositions refer to 
quantum properties at one time rather than to processes in time. 

The identification of the and/ or relations, as well as their properties (as- 
sociativity, distributivity, etc.) is crucial to defining what kind of setups we 
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arc talking about and therefore crucial to establishing the subject of quantum 
mechanics. Thus, our first assumption is 

Al. The goal of quantum theory is to predict the outcomes of experiments 
involving setups built from components connected through and and or. 

It is important to emphasize that this quantum theory coincides with the stan- 
dard Copenhagen quantum theory (see The contribution, at this point, 
has been to make explicit the relations and/or which are normally implicit in 
the Feynman approach [flo|| . 

The next step involves an assumption about the existence of a quantitative 
tool to handle these relations and/ or: 

A2. A mathematical representation of and/ or is established by the consistent 
assignment of a single complex number (f>(a) to each setup a in such a 
way that relations among setups translate into relations among the corre- 
sponding complex numbers. 

What gives the theory its robustness, its uniqueness, is the requirement that the 
assignment be consistent. If there are two different ways to compute the single 
number <p(a) is assigned to setup a the two answers must agree. The remarkable 
consequence of this consistency constraint is the 'regraduation' theorem || that 
all such representations are equivalent: changing representations involves mere 
changes of variables. Thus, one can always 'regraduate' <j){a) with a function ip 
to switch to an equivalent and more convenient representation, ip{a) = ip((f){a)), 
in which and and or are respectively represented by multiplication and addition. 
Explicitly, ip (ab) = ip (a) tp (b) and ip (a V a') = ip (a) + ip (a 1 ). Anticipating the 
important role played by these conveniently assigned complex numbers we call 
them by the suggestive name of 'amplitudes'. These amplitudes have a clear 
meaning, they are tools for reasoning quantitatively and consistently about the 
relations and/ or. For an earlier derivation of the quantum sum and product 
rules see rcf. [^o|. For comments on the possibility that such a representation 
of and /or might not exist see ref. [ [Hill . 

The observation that a single filter that is totally covered with holes is equiv- 
alent to having no filter at all leads to the fundamental equation of evolution. 
The idea is expressed by writing the relation among setups 

[ x f,Xi] = V ([xf,Xt}[x t ,Xi}) (2) 

all x at t 

in terms of the corresponding amplitudes ||io|| . Using the sum and product rules, 
we get 

ip(xf,Xi)= ^ i>{xf,xt)i){x t ,Xi) . (3) 

all x at t 

Following Feynman [[to] , we introduce the wave function \t(x, t) as the means 
to represent those features of the setup prior to t that are relevant to time 
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evolution after t. Notice that there are many possible combinations of starting 
points Xi and of interactions prior to the time t that will result in identical 
evolution after time t. What these different possibilities have in common is that 
they all lead to the same numerical value for the amplitude ipixt, xt). Therefore 
we set ^{x, t) — ip(x t ,Xi) and all reference to the irrelevant starting point X4 is 
omitted. The traditional language is that \& describes the state of the particle 
at time t, that the effect of the interactions was to prepare the particle in state 
"J . Now we see that the word "state" just refers to a concise means of encoding 
information about the preparation procedure prior to time t that is relevant for 
the evolution after time t. 

The equation of evolution (|^) can then be written as 

= ^{xf,tf,S,t)^(x,t), (4) 

all x at t 



which is equivalent to a linear Schrodinger equation as can easily be seen 




by differentiating with respect to tf and evaluating at tf = t. Thus, a quantum 
theory formulated in terms of consistently assigned amplitudes must be linear; 
nonlinear modifications of quantum mechanics must violate assumptions Al or 
A2 else be internally inconsistent ]is|| . 

The question of how amplitudes or wave functions are used to predict the 
outcomes of experiments is addressed through the time evolution equation (||). 
For example, suppose the preparation procedure is such that ^(x, t) vanishes 
at a certain point xq. Then, according to eq. (||), placing an obstacle at the 
single point (xo,t) (i.e., placing a filter at t with holes everywhere except at 
xq) should have no effect on the subsequent evolution of 'P. Since relations 
among amplitudes are meant to reflect corresponding relations among setups, 
it is natural to assume that the presence or absence of the obstacle will have no 
effect on whether detection at xj occurs or not. Therefore when ^>(x ,t) = 
we predict that the particle will not be detected at (xq, t). This assumption can 
be generalized to the following general interpretative rule: 

A3. If the introduction at time t of a filter blocking those components of the 
wave function characterized by a certain property V has no effect on the 
future evolution of a particular wave function ^S(t) then when the wave 
function happens to be ^ (t) the property V will not be detected. 

The application of this rule requires a means to quantify the difference between 
wave functions before and after a filter. In ref. Q we showed how the interpreta- 
tive rule A3 implies the Born postulate provided this difference is measured by 
a Hilbert norm. In the next section we justify this choice as being the uniquely 
natural one. 
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3 The Hilbert inner product 



In order to justify the use of the Hilbert norm we show how the concepts of 
distance and angle among states, that is an inner product, can be physically 
motivated. The argument has three parts. 

First, we note that wave functions form a linear space. To illustrate this point 
suppose that ^i(x, t) = tp(x, t; x\, to) is the wave function at time t of a particle 
that at time to was prepared at the point x\ , and ^2{x, t) = ip(x, t; X2, to) is the 
wave function at time t of a particle that at time to was prepared at the point 
x%. It is easy to prepare linear superpositions of ^&i(x, t) and ^2(x, t) by placing 
the original source of the particle at an initial point (2^, ti) with U earlier than 
to and letting the particle evolve through a filter at to with two holes, one at x\ 
and the other at x 2 - Then the amplitude ij){x,t]Xi,ti) is 

ip(x,t;xi,ti) = ip(x,t;xi,to)ip(xi,t ;Xi,ti) + ip(x , t; xi, t )ip(xi, t ; £*, U) , (5) 

and, in an obvious notation, the wave function at time t is given by the super- 
position 

= a*i(x,t) +j3^ 2 {x,t) . (6) 

Notice that the complex numbers a and j3 can be changed at will either by 
changing the starting point (xj,tj) or by modifying the setup between ti and to 
in any arbitrary way. 

It is interesting that within the CAQT approach there is a deep connection 
between the linearity of the space of wave functions and the linearity of time 
evolution: they both follow from the same sum and product rules, and ulti- 
mately, from consistency. In contrast, within the traditional approach ||||| the 
two forms of linearity are seemingly unrelated; the first is a kinematical feature 
while the second is dynamical. In fact, attempts to formulate non linear variants 
of quantum theory give up the second linearity, that of time evolution, while 
invariably preserving the first [ pT| . 

The second part of the argument is to point out that the basic components 
of setups, the filters, already supply us with a concept of orthogonality without 
invoking any additional assumptions. 

The action of a filter P at time t with holes at a set of points x p is to turn 
the wave function ^>(x) into the wave function 

P*(aO ■ (7) 

p 

Since filters P act as projectors, P 2 — P, any given filter defines two special 
classes of wave functions. One is the subspace of those wave functions such 
as vj/p = pvp that are unaffected by the filter, P* p = vj/ p . The other is the 
subspace of those wave functions that are totally blocked by the filter, such as 
*i_p = (1 — P) 1 ^, for which P^ 1 _ P = 0. We will say that these two subspaces 
are orthogonal to each other. 
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Any wave function can be decomposed into orthogonal components, 

# = P* + (1 -P)W = fp + fi-p . (8) 

A particularly convenient expansion in orthogonal components is that defined 
by a complete set of elementary filters. The filter Pi is elementary if it has a 
single hole at Xi, it acts by multiplying $(x) by Sg^ and the set is complete if 

5> = 1- (9) 

i 

Then _^ 

= J] P*(f ) = J] A< fe^ . (10) 

i i 

where Aj = ^(xi) and for i ^ j the basis wave functions and 5g t gj are 
orthogonal. 

In the third and last step of our argument, as a matter of convenience, we 
switch to the familiar Dirac notation. Instead of writing ^>(x) and 8g g t we shall 
write l^ff) and \i), so that 

i*> = X>i<>. (u) 

i 

The question is what else, in addition to the notion of orthogonality described 
above, is needed to determine a unique inner product. Recall that an inner 
product satisfies three conditions: 

(a) {*|*) Ss with (*|*) = if and only if |W) = 0, 

(b) linearity in the second factor ($|ai\l/i + = ai( < i 5 | v I / i} + 02(^^2) , 

(c) antilinearity in the first factor, ($1*) = (*|$)*. 

Conditions (b) and (c) determine the product of the state |$) = J2j with 
= Yli Aj|i) in terms of the product of \j) with 

(m=J2 B jMj\i) ■ (12) 

i 

The orthogonality of the basis functions 5g y s i is easily encoded into the inner 
product, just set — for i ^ j. But the case i — j remains undetermined, 
constrained only by condition (a) to be real and positive. Clearly, an additional 
ingredient is needed and to find it we reason as follows. 

Suppose, that some prediction is made concerning the detection of the par- 
ticle at Xi when the state is \^f) (eq. |ll|). Consider now another state \ty') 
= J^i Aj|i + fc) obtained from \ $>) by a mere translation. What prediction should 
we make concerning detection at Xi+fc? Since relations among amplitudes are 
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meant to reflect corresponding relations among setups, it seems natural to as- 
sume that the latter prediction should coincide with the former. As we show 
below this is achieved if we set = (i + k\i + k), that is, we choose 
equal to a constant which, without losing generality, we may set equal to one. 
Therefore, 

= <% ■ (13) 

This fixes a unique inner product 

(^) = Y^BtAi, (14) 

i 

and yields the Hilbert norm 

||*|| 2 ^(*|*)=^|^| 2 . (15) 

i 

Thus, we have arrived at the first main result of this paper: the principle 
of insufficient reason enters quantum theory through the inner product. Our 
assumption can in general be stated as 

A4. If there is no reason to prefer one region of configuration space over another 
they should be assigned equal a priori weight. 

One should point out that the symmetry argument invoked here and the usual 
symmetry arguments leading to conservation laws through Noether's theorem 
are of a very different nature. The latter depends strongly on the particular 
form of the Hamiltonian, on the dynamics; the former is totally independent of 
the Hamiltonian. 

The deduction of the Born statistical rule now proceeds as in ref . || . Briefly 
the idea is as follows. We want to predict the outcome of an experiment in which 
a detector is placed at a certain Xf. when the system is in state (|ll|). In M we 
showed that the state for an ensemble of N identically prepared, independent 
replicas of our particle is the product 

JV 

\*n) = II l*»> = E A >i ■ ■ ■ \iN)...\h) ■ (16) 

a—l zi...zjv 

Suppose that in the iV-particle configuration space we place a special filter, 
denoted by Pf E , the action of which is to block all components of \^n) except 
those for which the fraction n/N of replicas at Xk lies in the range from / — e 
to / + e. The action of such a filter is represented by the projector 

U+e)N 
n=(f-e)N 
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where the P% are themselves projectors that select those components of \*S?n) 
for which the number of replicas at Xk is exactly n, 



Pn= l*tf>~l*l}*n,n*<n| 



JV 



where n k = ^ 8 k 



(18) 



Next we prepare to apply the interpretative rule: we want to know whether 
for large N the state P^ J^jv) after the filter differs or not from the state \^n) 
before the filter. The distance between Pi \^n) and I^jv), measured by the 



norm, 

\\P^n) - \^n)\\ 2 , 
need not converge as N — > oo, but the relative distance 

p^l^}-!^)' 12 



I* 



(19) 



(20) 



N/ 



does. The calculation is straightforward ||. We first normalize \H>), 

^|^| 2 = l, (21) 



so that (^/atI^at) = 1 and the relative distance (|20|) coincides with (|19|). The 
result is 



|j^ iB i^>-i^>f = i- 

n={f-e)N 



(\A k \ 2 ) n (l - \A k \ 



, N—r 



(22) 



For large N this binomial sum tends to the integral of a Gaussian with mean 
/ = \A k \ 2 and variance a 2 N = f(l — f)/N. In the limit N — > oo this is more 
concisely written as a 8 function. Therefore 



lira 

N—>oo 



\^n)\ 



f+e 



f-e 



\A k \ 2 )df>. 



(23) 



This shows that for large N the filter Pj e has a negligible effect on the state 
|^jv) provided / lies in a range 2e about |Afc| 2 . Therefore, according to the 
interpretative rule A3, the state |^jv) does not contain any fractions outside 
this range. On choosing stricter filters with e — > 0, we conclude that detection 
at x k will occur for a fraction \A k \ 2 of the replicas and that it will not occur for 
a fraction 1 — |Afc| 2 . For any one of the identical individual replicas however, 
there is no such certainty; the best one can do is to say that detection will occur 
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with a certain probability Pr(fc). In order to be consistent with the law of large 
numbers the assigned value must be, 

P r (k) = \A k \ 2 . (24) 

Theoretical arguments always involve idealizations which if taken literally are 
clearly unrealistic. Some, such as the limit N — > oo, are obviously unphysical 
and yet routinely recognized as useful. But in other cases legitimate doubts 
may arise. One may, for example, question whether in invoking filters such as 
Pf e acting on the iV-particle configuration space and selecting wave function 
components with a very peculiar property V the idealizations are being pushed 
too far. While recognizing that attempts to persuade all skeptics are doomed to 
fail, perhaps the following two-dimensional example, borrowed from p3f , may 
be of some help. 

Consider the special case where the single particle state ( pd| ) contains just 
two terms, say \^) = Ai\l) + A 2 \2). This is analogous to a spin 1/2 particle. 
To pursue this analogy imagine the individual spins of the A^-particle ensemble 
are conveniently arranged in a little crystal. This ensemble would have definite 
fractions of particles with spin up (say, |1}) without any individual spin being 
itself definitely up or down. One way to determine this fraction consists of 
sending the little crystal through a suitable Stern-Gerlach device to measure its 
total spin by observing how it is deflected from the original trajectory. The filter 
Pi e is easy to visualize: it is just a slit that allows passage when the deflection 
has the appropriate value. According to the interpretative rule A3 the property 
V detected is that the fraction of particles with spin up is \ A\ | 2 . The state of the 
ensemble \^n) as well as the state of each individual spin remains unchanged 
in such a measurement. 

But there is a second way to determine the definite value of the fraction of 
spins up; it consists of detecting each individual spin and counting the number 
with spin up. This second method will affect the state of the ensemble as well as 
the state of each individual spin but it is a legitimate way of detecting the same 
property V and whatever the result it should agree with the previous method. 
Since there is a definite fraction of spin up results and a definite fraction of spin 
down results for an individual particle one can make no definite prediction. The 
best we can do is assign a probability \Ax\ 2 that the outcome will be spin up. 

In our case we are concerned with position rather than spin but suitable 
modifications are conceivable; perhaps the required 'Stern-Gerlach' device could 
directly measure the center of mass of the ensemble rather than its total mag- 
netization. 

After this digression, let us return to assumption A4 and further explore 
its implications. Suppose, for example, that the sites of the discrete lattice on 
which the particle 'moves' are unevenly spaced. Then there is no reason to 
give equal weights to different \i)'s. The consequences of choosing a different 
normalization = WiSij are easy to track down: the weighted inner product 
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of |$) = £\ BAj) with |*) = becomes 

($|*) =Y d W i B* i A i , (25) 

i 

the completeness relation (^) becomes 

l = X)Pi = X)i« i - 1 |i><i|. (26) 

and the probability of detection at affc would not be given by the Born rule but 
rather by Pr(fc) = Wk\A k \ 2 . 

An appealing but still arbitrary choice is to weight each cell of the lattice by 

1 /2 

its own volume which we write as Wi = g i Ax. This is particularly interesting 
in the continuum limit Ax — > 0. First, write the completeness relation (^) as 

i = -£sl /2 Ax W M . (27) 

9i &X9i Ax 

On replacing Ax by g 1 / 2 dx and (g^Aa:) -1 |i) by |x) the new completeness 
condition becomes 

J g 1/2 dx\x) (x\ = 1 . (28) 

-> -> 1/2 

Next, replace Sy/Ax by <5(x — af') and the inner product = g i AxSij 

becomes 

= g- 1/2 6{x - x') . (29) 
Furthermore, on replacing Aj by .A (a?), the state |^) = .A, |z) becomes 

i 

I*} = / 3 1/2 ^ A(z?) |f) , (30) 

and the Born rule Pr(z) = Wi|Ai| 2 becomes 

Pr(dx)=g 1/2 dx\A(x)\ 2 . (31) 

As expected, |A(:c)| 2 is the probability density. These results apply to situa- 
tions where the homogeneity of space is hidden by an inconvenient choice of 
coordinates, and also to intrinsically inhomogeneous, curved spaces. 

We see that the Born rule follows, even in curved spaces, from giving the 
same a priori weight, the same preference, to spatial volume elements that are 
equal. This is a perhaps unexpected connection between quantum theory and 
the geometry of space and one suspects that it is not accidental. It is tempting 
to invert the logic and assign equal volumes to spatial regions that are equally 
preferred. This would explain what a physical volume is: just a measure of a 
priori preference. The full implications of these remarks remain to be explored. 
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4 Array entropy and unitary time evolution 



In a situation of optimal information everything that is relevant about the ex- 
perimental setup prior to time t = is known, then the wave function ^(x, 0) 
is known. But if less information is available perhaps the best we can do is con- 
clude that the actual preparation procedure was one among several possibilities 
a = 1, 2, 3, ... each one with a certain probability p a . For simplicity we initially 
assume these possibilities form a discrete set. The usual linguistic trap is to 
say the system is in state ^ a {x, 0) with probability p a , but it is better to say 
that the preparation procedure is ^ a (x, 0) with probability p a [ ^5| . To this state 
of knowledge, which one may represent as a set of weighted points in Hilbert 
space, and which Jaynes referred to as an array fis|[p6|| , one can associate an 
entropy, called the array entropy 



A valid objection to using this quantity as the entropy of the quantum system 
is that if the ^ a (x, 0) are not orthogonal then the p a are not the probabilities 
of mutually exclusive events. When regarded as a property or an attribute 
of the quantum system the various ^ a (x, 0) need not, in fact, be mutually 
exclusive; if ( 1 I r Q ,|\I f i g) ^ 0, knowing that the system is in \& a (x, 0) does not 
exclude the possibility that it will be found in ^ p{x, 0). However, if the ^ a (x, 0) 
are attributes of the preparation procedure then they are mutually exclusive 
because the preparation devices are macroscopic! Sa is a useful concept when 
interpreted as the entropy of the whole setup and not as the entropy of the 
quantum system. 

The importance of this conceptual point cannot be overemphasized and a 
more explicit illustration may clarify it further. Consider a spin 1/2 particle 
prepared either with spin along the z direction or with spin along the x direction. 
These states are not orthogonal and by 'looking' at the particle there is no sure 
way to tell which of the two alternatives holds, and yet nothing prevents one 
from looking directly at the macroscopic Stern-Gerlach magnets. This will reveal 
which of the two mutually exclusive orientations was used without affecting the 
wave function. One can distinguish non-orthogonal states by looking at the 
macroscopic devices that prepared the system rather than by looking at the 
system itself. 

Turning to the issue of time evolution, we consider situations where those 
parts of the setup after time are known and no further uncertainty is intro- 
duced. Under these conditions the points of the new array are shifted from 
\I/ a (x, 0) to \I/ Q (x, t) but their probabilities p a and the corresponding array en- 
tropy Sa remain unchanged. 

The uncertainty discussed in the previous paragraphs led to a probability 
distribution defined over a discrete array but, in general, we may have to deal 




(32) 



a 
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with a continuous array. This is of considerable significance for the issue of time 
evolution. 

The simplest continuous array is one dimensional, a weighted curve C in 
Hilbert space. We could consider higher dimensional arrays but this would un- 
necessarily obscure the argument that follows. The reparametrization- invariant 
entropy of this continuous array is p3 



where p{a)da is the probability that the preparation procedure lies in the 
interval between a and a + da and £(a)da is a measure of the distance in 
Hilbert space between ^ a (x,0) and ^/ a+ d a (x , 0) . As discussed in the last sec- 
tion the Hilbert norm is the uniquely natural choice of distance, thus £(a)da = 



Again we consider setups for which no further uncertainty is introduced 
between times and t. We find that points \1/ Q (x, 0) of the old line array at 
t = will move to points ^ a (x, t) to form a new line array at time t. Since 
no information was lost between times and t we expect that, just as in the 
discrete case, the probabilities p{a)da remain unchanged and the corresponding 
array entropy Sa is conserved. But entropy conservation, 



should hold for any curve C and any function p(a), therefore 



The conservation of the array entropy leads to the conservation of Hilbert space 
distances. Since linear transformations that preserve the Hilbert norm are called 
unitary we conclude that time evolution is given by a unitary transformation. 
The Hamiltonian must be Hermitian; energy eigenvalues are real. 
In the argument above it is implicit that 

A' 5. The experimental setups about which we wish to make predictions involve 
no loss of information. 

This assumption is of a somewhat different nature than the previous ones - 
thus the prime. Since the objective of A'5 is to specify more precisely what 
are the experimental setups we are dealing with, A'5 is in effect contributing to 
define the subject of quantum theory. It may, therefore, be more appropriate 
to include A'5 as part of Al. On the other hand, one can also make the case 
that A'5 is already implicit in A2: it is only to those setups that have been 
optimally specified that one can assign a single complex number. In any case, 
the purpose of A'5 is to make explicit that in these setups entropy must be 
conserved. 




(33) 



|||* a+dQ )-|*a)ll- 




(34) 
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5 Observables other than position 



The experiments we have discussed involve position detectors. One could say 
we have only considered position 'measurements', but this usage of the word 
'measurement' requires some caution. The problem is that it suggests that 
before the 'measurement' the particle had a position, the value of which, albeit 
unknown, was very definite. This is an assumption that need not and should not 
be made; statements about whether the particle has a position or not should 
be avoided. These statements are not identifiable with experimental setups, 
and according to our assumption Al, they are foreign to the subject matter of 
CAQT; they are not even wrong, they are meaningless. What has a definite 
position is the detector, not the particle f27j| . 

The issue we address next concerns other observables, how they are 'mea- 
sured' and what role they play. 

To build more complex detectors one can modify the setup just prior to the 
final position detection at Xf by introducing, for example, additional magnetic 
fields or diffraction gratings. Suppose that the setup prior to time t prepares 
the system in a certain state. After time t the time evolution will in general be 
given by eq. (^) but suppose that interactions between the time t and the time 
of detection t j are arranged in such a way that if the wave function happened to 
be the function ^j(x, t) then at the later time tf the new wave function <f)j(x, tf) 
would vanish everywhere except at Xj , 

<Pj(nf,tf) = J2^(xf,t f ;x,t)<i> j (x,t) = 8 Sft3j . (36) 

all x 

In this special case the particle would be detected at Xj with certainty and we 
would say that "at time t / the particle was found at Xj" . Alternatively, we could 
describe this same result and convey additional relevant information about the 
setup by saying that "at time t the particle was found in state $j(x, t)". Thus, 
the latter form of speech, although somewhat inappropriate, has the virtue of 
being more informative. 

The generalization is straightforward: arrange interactions so that each state 
&j (x, t) of a complete and orthogonal set is made to evolve to a corresponding 
state cj)j(x,tf) — Sg_Sj- The wave function at time t can be expanded 

*(f,t) = 5^a i $ J -(f J t), (37) 

j 

and this evolves to 

*(*,*/) =X>J (38) 

i 

Invoking the Born rule we interpret this as "the probability that at time t / the 
particle is found at Xj is |aj| 2 ," or alternatively, and somewhat inappropriately, 
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that "the probability that at time t the particle is found in state <E>., (x, t) is 




What this particular complex detector 'measures' is all observables of the 
form Q = J2 n / ra |$ ra )($n| where the /„ are arbitrary scalars. 

Notice that unitary evolution is a crucial requirement. In order for the expan- 
sion ( p7| ) to be unique the states $ n (x, t) must form a complete and orthogonal 
set which itself must evolve to the also orthogonal set of <fi n (x,tf) — <%2 n . The 
orthogonality must be preserved. One cannot introduce this notion of observ- 
ables until after the issue of unitary time evolution has been settled. 

Notice also that it is not necessary that the operator Q have real eigenvalues, 
but it is necessary that its eigenvectors be orthogonal. This means that the 
Hermitian and anti-Hermitian parts of Q must be simultaneously diagonalizable. 
Thus, while the observable Q does not have to be Hermitian (Q — Q<) it must 
certainly be normal, that is QQ^ = Q^Q. 

It is amusing to reflect that if the sentence "at time t a particle has momen- 
tum p" is used only as a linguistic shortcut that conveys the information that 
the wave function assigned to the setup prior to time t was exp(ip- x/K) then, 
strictly speaking, there is no such thing as the momentum of the particle. The 
point is that wave functions attach to setups and not to particles; whatever p 
is, it is not a property of the particle by itself, but of the whole setup. 

6 von Neumann's entropy 

We saw that the array entropy ( |32| ) is an acceptable measure of uncertainty 
provided it is associated with the whole experimental setup rather than the 
quantum system by itself. This interpretation hinged on the fact that prepa- 
rations are made using macroscopic devices with definite attributes that are in 
principle distinguishable and mutually exclusive even when the corresponding 
wave functions \t a are not orthogonal. 

But suppose that for some unspecified reason the part of the experimental 
setup responsible for the preparation is not directly accessible to observation 
and we can only look at the detectors themselves. This is what happens when 
the actual purpose of the experiment is to obtain information about the prepa- 
ration procedure. Many, maybe most experiments are of this kind. We can, for 
example, detect photons to obtain information about how they were originally 
prepared in a distant star, and thereby we learn about the star; or we can detect 
photons to find how they were prepared at the other end of a communication 
channel, and thereby we receive a message. In these cases a more useful, more 
relevant entropy might be one that measures the uncertainty about how the 
detectors will respond. 

Consider measuring an arbitrary observable Q = ^„/n|^n)($n| m a sit- 
uation where the preparation procedure is uncertain. If the wave function is 
^/^(x, 0) with probability p a the probability that the system is detected in state 
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|$n) is 



P. 



Q = ^2p a \(<Z> n \* a )\ 2 = (<f> n \p\<f> n ), 



(39) 



a 



where p is the density operator 



P = X)Pa|*a)<* 



(40) 



a: 



Thus, knowledge of p allows one to compute the probability of all experimental 
outcomes. An important implication of this result is that if all we can observe 
are the detectors then two different arrays with the same density operator p 
are indistinguishable; they yield experimental outcomes that are statistically 
identical no matter what experiment is performed. To distinguish among such 
arrays requires information which, in practice, is just not available. A second 
important feature is that since p is Hermitian it can be diagonalized, i.e., 



where (wp\w~) — 8^. Therefore, the set of all arrays with the same p includes 
an array that is orthogonal. 

The von Neumann entropy can now be introduced in either of two ways. 
First, we note that two different arrays with the same p need not have the same 
array entropy. What is remarkable is that even though for one array one might 
have a higher uncertainty about the preparation procedure this will not diminish 
our ability to predict the response of the detectors. As far as the detectors are 
concerned the additional uncertainty was irrelevant. The relevant uncertainty 
of all these arrays with the same p is the minimum value that the array entropy 
can attain. It can be shown that the minimizing array is the orthogonal one 
[ fl8| (see also ]28[]). The corresponding entropy is von Neumann's 



Notice that one cannot use the von Neumann entropy introduced in this first 
way to argue that time evolution must be unitary. If no information is dissipated 
one can reasonably expect that the array entropy of an array at time t\ should 
coincide with the array entropy at a later time t 2l but there is no reason to 
expect that the relevant part of these uncertainties should also coincide. In 
other words, a priori there is no reason to assume that it is the orthogonal array 
at ti that evolves into the orthogonal array at t 2 - 

A second way to introduce von Neumann's entropy is to focus attention 
directly on the response of the detectors. The uncertainty about which detector 
will fire when the observable Q is being measured is given by the so called 
measurement entropy 




(41) 




(42) 




(43) 



n 
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with pQ given by ( |39| ) . Notice that even if we have optimal information about 
the preparation procedure, that is, even if p represents a pure state, the mea- 
surement entropy need not vanish - there remains the uncertainty introduced 
by the measurement itself which is given by the Born rule probabilities. This 
indicates that S(p\Q) receives contributions from both the uncertainty in the 
preparation procedure and from the measurement itself. Naturally, the latter 
will depend on the choice of the observable Q. If one seeks a measure of the 
uncertainty in the preparation procedure one should choose that Q which makes 
the least contribution to S(p\Q). The desired observable is p itself J| and the 
corresponding entropy is von Neumann's, 

S vN (p) = mmS(p\Q) = S(p\p) = -Trplogp. (44) 
w 

Notice, again, that one cannot use the von Neumann entropy introduced in 
this second way to argue that time evolution must be unitary. The problem is 
that, as discussed in the previous section, the possibility of measuring arbitrary 
observables Q can only be established after the issue of unitary time evolution 
has been settled. 

To summarize, whichever way one chooses to introduce it, von Neumann's 
entropy represents that component of the uncertainty in the preparation proce- 
dures that is relevant to the response of the detectors. 

7 Final remarks 

The main goal of the CAQT is to justify the formalism of quantum theory on 
the basis of rather general assumptions. An important by-product is that it 
has revealed interesting connections among the various postulates of quantum 
theory. To illustrate this point and, in this context, summarize our main results, 
consider the following standard set of postulates: 

PI The states of a quantum system are represented by elements \ip) in a linear 
space (Pla) with an inner product (P2a), i.e., the are vectors in a 
Hilbert space. 

P2 The time evolution \4>(t)) = U(t) |^>(0)) is given by an operator U(t) which 
is both linear (P2a) and unitary (P2b). 

P3 Every observable A is represented by a Hcrmitian operator A. The out- 
come of a measurement of observable A is one of the eigenvalues a of the 
corresponding operator A, A\a) = a\a). 

P4 The Born postulate: the probability that the measurement of A in a system 
in the normalized state \ip) yields the eigenvalue a is given by |(a|?/')| 2 . 

P5 The projection postulate: after a measurement that yields the eigenvalue a 
the system is left in the eigenstate \a). 
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Consider first a possible connection between PI and P2. The idea that the 
wave function is just a way to codify whatever information is relevant for the 
purpose of making predictions about the future implies that an adequate speci- 
fication of the state will necessarily depend on the nature of the laws ruling time 
evolution. Conversely, deciding on a law of time evolution will depend on what 
it is that is evolving. But this connection between PI and P2 is not explicit 
in the usual approach. For example, both postulates Pla and P2a refer to 
linearity, but these seem to be unrelated, independent linearities. It appears 
possible to give up the dynamical linearity in P2a while preserving the kine- 
matical linearity in Pla. In the traditional approaches to quantum mechanics 
the kinematical aspects of the theory are kept isolated from the dynamical ones. 
In contrast, within the CAQT approach kinematics and dynamics are unified 
into a single structure and, in particular, there is only one linearity, which fol- 
lows from the consistency constraint in the form of the sum and product rules. 
The resulting formalism is more rigid, more robust; small modifications are not 
tolerated. 

The remaining postulates P3, P4 and P5 deal with observations and mea- 
surements. Since these physical processes are themselves ruled by PI and P2, 
it should be the case that the first two postulates already have a lot to say about 
what is and is not observable and what the allowed outcomes of measurements 
should be; parts of P3, P4 and P5 are redundant. We find that P3 and P5 
are redundant except those aspects that refer to experiments involving position 
detectors. Other observables are useful and convenient but not crucial. For 
these observables P3 makes no contribution beyond what is already contained 
in PI and P2. 

We have also found that unitary time evolution P2b and the Born probabil- 
ity rule P4 are linked in yet another way, they both follow from the Hilbert inner 
product Plb which is itself a consequence of a form the Principle of Insufficient 
Reason embodied in our assumption A4. 

The Born probability rule P4 is replaced by a milder and more compelling 
assumption, the general interpretative rule A3 which does not mention probabil- 
ities. From the point of view of the CAQT indetcrminism arises as a consequence 
of our assumption A2 that a single complex number provides an optimal means 
of codifying information about a setup and this information, while optimal, is 
definitely not sufficient. At this point it is still an open question whether more 
information could be codified into a single 'larger' mathematical object (say, a 
matrix) satisfying the associativity and distributivity requirements |2?| |p0[ . In 
any case the mystery remains: why complex numbers? Our assumption A4 has 
made explicit what is an intriguing and perhaps unexpected connection between 
the Hilbert inner product and spatial measures of volume. Perhaps the reason 
for complex numbers will be found in the geometry of space. 

Lastly, we comment on the pragmatic interpretation implicit in the CAQT 
approach. The interpretation and the formalism are irremediably entangled 
form the very beginning. Already from assumption Al the CAQT will make no 
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attempt to offer a model, a description of an underlying objective reality (the 
existence of which is, however, never doubted). Depending on one's prejudices 
this may either be totally unsatisfactory or it may be highly desirable. It incor- 
porates the deep insight of Bohr and Heisenberg that the subject of physics is 
one step removed from reality. Physics does not model reality itself, it models 
information about reality. 

The interpretation of the wave function follows from assumption A2: am- 
plitudes and wave functions are tools for reasoning. It is interesting that since 
*&(t) refers to a certain experiment and ^(t + St) to another, different, albeit 
closely related experiment, it is inappropriate to say that ^(t) has 'moved' to 
\t (t + St). Wave functions 'evolve' in a sense that is somewhat analogous to the 
changes between successive frames in a film; there may be an illusion but there 
is no real motion. Thus the Schrodinger equation is not an equation of motion, 
it is an equation of evolution. Wave functions do not move and consequently 
they do not collapse either. 

While this pragmatic interpretation coincides in its most crucial aspects with 
the Copenhagen interpretation [[fl| there are some differences. There is no need 
to invoke complementary features for the description of the quantum system 
because the CAQT never attempts such a description in the first place. The 
doctrine of complementarity is not needed, and in fact it would represent a step 
in the wrong direction, a half-hearted attempt to peek behind the curtain and at 
least partially describe 'what is really going on'. Similarly, since quantities are 
not associated to the system by itself there is no need to assert that the values of 
certain quantities associated to the system are created by acts of measurement. 
In fact, there is no 'measurement.' 
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